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Abstract 

Single-stranded DNA binding (SSB) proteins are essential for all DNA-dependent cellular processes. Typical SSB proteins 
have an N-terminal Oligonucleotide-Binding (OB) fold, a Proline/Glycine rich region, followed by a C-terminal acidic tail. In 
the genome of the heterocystous nitrogen-fixing cyanobacterium, Anabaena sp. strain PCC7120, alr0088 and alr7579 are 
annotated as coding for SSB, but are truncated and have only the OB-fold. In silico analysis of whole genome of Anabaena 
sp. strain PCC7120 revealed the presence of another ORF 'all4779', annotated as a hypothetical protein, but having an N- 
terminal OB-fold, a P/G-rich region and a C-terminal acidic tail. Biochemical characterisation of all three purified recombinant 
proteins revealed that they exist either as monomer or dimer and bind ssDNA, but differently. The AII4779 bound ssDNA in 
two binding modes i.e. (AII4779) 35 and (AII4779) 66 depending on salt concentration and with a binding affinity similar to 
that of Escherichia coli SSB. On the other hand, Alr0088 bound in a single binding mode of 50-mer and Alr7579 only to large 
stretches of ssDNA, suggesting that AII4779, in all likelihood, is the major typical bacterial SSB in Anabaena. Overexpression 
of AII4779 in Anabaena sp. strain PCC7120 led to enhancement of tolerance to DNA-damaging stresses, such as y-rays, UV- 
irradiation, desiccation and mitomycinC exposure. The tolerance appears to be a consequence of reduced DNA damage or 
efficient DNA repair due to increased availability of AII4779. The ORF all4779 is proposed to be re-annotated as Anabaena 
ssb gene. 
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Introduction 

Single-stranded DNA-binding proteins (SSB) are ubiquitous 
proteins found in all bacteria. The SSB proteins are characterised 
by their non-specific binding to single-stranded DNA (ssDNA) and 
active participation in the maintenance of genome integrity (DNA 
repair) as well as genetic information transfer (replication and 
transcription) [1]. A typical SSB monomer consists of (a) an N- 
terminal region having several conserved residues responsible for 
binding to ssDNA, tetramerisation and stabilization of monomer 
fold [2], and (b) a C-terminal region which displays a low sequence 
conservation except for the last few amino acids (known as acidic 
tail), and is responsible for protein-protein interactions and 
recruitment of DNA interactive proteins [3] . The highly conserved 
OB-fold has been extensively described for Escherichia coli SSB 
protein [1]. Deletion of C-terminus diminishes recruitment of 
other DNA interacting proteins [3] , but enhances the affinity of N- 
terminus to ssDNA [4] . The spacer region between the N-terminal 
OB-fold and C-terminal acidic tail is rich in proline/ glycine (P/ G) 
residues and is thought to modulate the strength of DNA binding, 
possibly by distancing the highly negatively charged C-terminal 
end from the positively charged DNA binding N-terminal domain 
[5]. 



Nitrogen-fixing cyanobacteria, such as strains of Anabaena and 
JVostoc exhibit tolerance to a variety of abiotic stresses such as 
salinity, desiccation, heat and radiation [6,7], strongly indicative of 
a robust mechanism of DNA repair in these microbes [8]. 
Unfortunately, genes/proteins involved in DNA metabolism of 
cyanobacteria have not received adequate attention. The SSB 
protein is a key protein involved in all DNA related cellular 
activities. In the genomic database of Anabaena sp. strain PCC7120 
(hereafter referred to as Anabaena 7120), two ORFs 'alr0088' and 
'alr7579' are annotated as coding for SSB-like proteins (http:// 
genome.microbedb.jp/cyanobase/Anabaena) and exhibit 28— 
30% homology at amino acid level with EcoSSB, and about 
42% homologous to each other. However, BLAST search [9] of 
the amino acid sequence of these two proteins show that the 
protein sequence is terminated immediately after the N-terminal 
OB-fold region and have no region corresponding to either the P/ 
G spacer or the C-terminal acidic tail. Since, C-terminal acidic tail 
is essential for interaction with other DNA replication/repair/ 
recombination proteins [3], it seems unlikely that the proteins 
encoded by these two ORFs can perform all the functions of SSB 
proteins. However, this does not rule out that they are genuine 
SSBs, since the second SSB (SsbB) of naturally transformable 
bacteria, such as Bacillus subtilis, lacks the C-terminal acidic tail, 
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Table 1. Primers used and PCR amplicons generated in this study. 



Primers 


Nucleotide Sequence* 


R.E. 


Amplicon* 


o/r008SFwd 


5' GGCCATATGAGCATTAACATTGTC 3' 


Nde\ 


alr0088 ORF (0.35 kb) 


alr0088 Rev 


5' GGCGGATCCTTAAAAATTTTCTGGTGC 3' 


6am HI 




o/r7J79Fwd 


5' GGCCATATGAACTATATCAACAAA 3' 


Nde\ 


alr7S79 ORF (0.38 kb) 


alr7579 Rev 


5' GGCGGATCCCTAGAAATTTGCGTTAGC 3' 


6am HI 




o//4779Fwd 


5' GGCCATATGAACAGCTGTGTTTTA 3' 


Nde\ 


all4779 ORF (0.55 kb) 


all4779 Rev 


5' GGCGGATCCTAAAATGGAATATCGTC 3' 


6am HI 





*The restriction endonuclease (R.E.) site included in each primer is underlined and the corresponding R.E. indicated in the adjacent column. 
# The amplicons generated with a given set of PCR primers are specified. 
doi:1 0.1 371 /journal.pone.0093592.t001 



but functions as a SSB and is involved in genetic recombination 
[10,11]. Alr0088 and Alr7579 exhibit about 36-38% overall 
homology with BsSsbB. An in silica search for a SSB-like protein 
with the C-terminal region (i.e. the spacer region and acidic tail) in 
the genome of Anabaena 7120, revealed the ORF 'all4779'. The 
ORF encodes a 182 amino acid long protein with an N-terminal 
OB-fold, a P/G rich spacer region and a C-terminal acidic tail but 
has been annotated as a hypothetical protein possibly due to its 
lower homology (15-18%) with other bacterial SSBs. The 
homologs of the two truncated SSB-like proteins as well as 
hypothetical SSB-like protein of Anabaena 7120 are found across all 
cyanobacterial genomes (http://genome.microbedb.jp/ 
cyanobase). 

In the present work, we cloned, overexpressed, purified and 
biochemically characterised Alr0088, Alr7579 and A114779 
proteins. All three proteins existed in monomeric and dimeric 
forms and showed differential binding to ssDNA. A114779 protein 
showed typical structural features, oligomerisation, ssDNA binding 
modes compared to E. coli SSB and conferred tolerance to DNA 
damage, upon overexpression in Anabaena, that identifies it as the 
major SSB of Anabaena. 

Materials and Methods 

Organism and Growth Conditions 

E. coli cells were grown in Luria-Bertani (LB) medium at 37°C 
with shaking (150 rpm). When required antibiotics [34 ug 
chloramphenicol mL '(Cm^), 50 (ig kanamycin mL 1 (Kan 50 ), 
or 100 (ig carbenicillin mL -1 (Cbioo)] were used in culture media. 
Axenic cultures of Anabaena 7 1 20 were grown in BG- 1 1 liquid 



medium, pH 7.0 [12] in the absence of combined nitrogen (BG- 
1 1, N~) under stationary conditions with continuous illumination 
(30 |iE m 2 s ) at 27°C±2°C. Recombinant Anabaena strains 
were grown in the presence of 10 (ig neomycin mL 1 (Nmio) in 
BG-1 1 liquid media or with 25 ug neomycin mL -1 (Nmgs) on BG- 
1 1 agar plates. Growth was assessed in terms of chlorophyll a 
content per ml of culture as described earlier [13]. Cell survival 
was assessed in terms of colony forming units (cfu) by plating 
100 ul of the culture on to BG-11, N~ agar plates followed by 
incubation under illumination for 10 days as described earlier 
[14]. 

Three-day-old nitrogen-fixing Anabaena cultures were concen- 
trated to a chlorophyll a density of 10 ug mL , prior to subjecting 
them to one of the following stresses: (i) 6 kGy of Co y-rays at a 
dose rate of 4.5 kGy h , or (ii) 6 days of desiccation or in humid 
chamber (control), or (iii) 0-4 (Ig mitomycinC (mitC) mL 1 for 
30 min, or (iv) exposure to 0-1.5 kj UV-B (280 nm) (dose rate 5 J 
m 2 sec ) for different duration. Survival in response to stress, 
and post-stress recovery were compared with unstressed/control 
cultures grown under illumination at 27°C±2°C as described 
earlier [14]. 

Generation of Plasmid Constructs for Overexpression of 
Proteins in E. coli 

Different amplicons (alr()088, alr7579, all4779) were generated 
by PCR amplification of Anabaena 7120 genomic DNA (100 ng) 
using gene specific primers (as indicated in Table 1), 1 uM dNTPs 
and 1U Taq DNA polymerase in Taq buffer (Bangalore Genei, 
India). These amplicons were individually digested with Ndel and 
BamYLI restriction endonucleases and ligated to the expression 



Table 2. Plasmids used in this study. 




Plasmids 


Description 


Source/Reference 


pET16b 


Cb r , expression vector 


Novagen 


pAM1956 


Kan r , promoterless vector with gfpmut2 reporter gene 


[19] 


pBluescript (pBS) 


Cb r , cloning vector 


Lab Collection 


pFPN 


Cb r , Kan r , integrative expression vector 


[18] 


pETalr0088 


Cb', 0.35 kb alr0088 gene cloned in pET16b at Nde\IBamH\ restriction sites 


This study 


pETalr7579 


Cb r , 0.38 kb alr7579 gene cloned in pET16b at Nde\IBamH\ restriction sites 


This study 


pETall4779 


Cb', 0.55 kb all4779 gene cloned in pET16b at Nde\IBamH\ restriction sites 


This study 


pFPNall4779 


Cb', Kan', 0.55 kb all4779 gene cloned in pFPN at Nde\IBamH\ restriction sites 


This study 


pAMall4779 


Kan r , 0.81 kb Xma\-Sal\ fragment from pFPNa//4779 gene cloned in pAM1956 


This study 


doi:1 0.1 371 /journal.pone.0093592.t002 
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Table 3. Bacterial strains used in this study. 





Bacterial Strains 


Description 


Source/ 
Reference 


£ coli strains 


DH5a 


F recA41 endAI gyrA96 fW-1/7sc/R17 (rk mk ) sup£44 relA X A/acU169 


Lab Collection 


HB101 


F~mc r Bm r rhsdS20(r B _ m B _ ) recAU leuB6 ara-14 proA2 lacYl galK2 xyl-5 mtl-1 rps (Sm R ) g/nV44 X" 


Lab Collection 


BL21(pLysS) 


Cm r ?~ompT hSdSB (r B ~m B ~) gal dcm pLysS (pLysS) (DE3) 


Novagen 


BL21 (pLysS)(pETo/r00S8) 


Cm r , Cb r , £ coli BL-21 cells harbouring the plasmid, pETalr0088 


This study 


BL21 (pLysS)(pETo/r7J79) 


Cm r , Cb r , £ coli BL-21 cells harbouring the plasmid, p£Jalr7579 


This study 


BL21(pLysS)(pETo//4779) 


Cm', Cb r , £ coli BL-21 cells harbouring the plasmid, pETall4779 


This study 


Ec(pAMo//4779) 


Kan', HB101 harbouring pAMo//4779 plasmid 


This study 


HB101 (pRL623+ pRL443) 


Donor strain carrying pRL623 (encoding methylase) and conjugal plasmid pRL443 


(Wolk, CP.) 


Anabaena strains 


Anabaena 7120 


Wild type strain 


Lab Collection 


AnpAM 


Nm r , Anabaena 7120 harbouring the plasmid, pAM1956 


[26] 


Ana//4779 + 


Nm r , Anabaena 7120 harbouring the plasmid, pAMall4779 


This study 



doi:1 0.1 371 /journal.pone.0093592.t003 
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Figure 1. Bioinformatic analysis of Alr0088, Alr7579 and AII4779 proteins of Anabaena 7120. (A) Conserved Domain Database (CDD) 
analysis of Anabaena Alr0088, Alr7579 and AII4779 proteins and £ coli SSB (EcoSSB) protein. The OB-fold corresponding ssDNA binding region, dimer 
and tetramer interfaces for all the proteins are indicated. (B) Comparison of homology between predicted amino acid sequence of AII4779 and 
EcoSSB. The identical amino acids are indicated by letters and similar amino acids with a '+' sign. The proline (P) and glycine (G) residues beyond the 
OB fold are shown in larger font, while the acidic residues at the C-terminal end are in bold and underlined. The numbers on the left and right hand 
side correspond to amino acid residues. 
doi:10.1371/journal.pone.0093592.g001 
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Elution volume (mL) 



Figure 2. Molecular mass determination of purified native Anabaena proteins. (A) Ni-NTA affinity chromatography purified Alr0088, Alr7579 
and AII4779 proteins separated on 12% SDS-polyacrylamide gel followed by staining with Coomassie Brilliant Blue (CBB) G-250. The purified proteins 
are indicated by arrows. (B) Elution profile of purified native Alr0088, Alr7579 and AII4779 proteins in gel filtration chromatography using Superdex 
HR200 matrix. A standard graph using the following standard proteins: [Myosin (200 kDa), p-galactosidase (116 kDa), Phosphorylase-b (97.4 kDa), 
Bovine Serum Albumin (66 kDa), Chicken Albumin (45 kDa), Carbonic Anhydrase (29 kDa) and RNaseA (13.7 kDa)] was drawn to calculate the 
molecular mass of the eluted native Anabaena proteins depending on their elution volume. The position of the eluted proteins has been depicted by 
'star' and 'triangle' symbols. The vertical and horizontal lines from the two symbols indicate the elution volume and the corresponding log of 
molecular mass. 

doi:1 0.1 371 /journal.pone.0093592.g002 



vector pET16b (Table 2), having Hisi 0 -tag at the 5' end, at 
identical restriction sites. The resulting plasmid constructs were 
designated as pETalrO088, pETalr7579 and pETall4779 respec- 
tively (Table 2). DNA insert of all three plasmids were sequenced 
on both strands using Sanger's dideoxy method and were found to 
be completely identical to the corresponding nucleotide sequences 
available in the genomic database. The nucleotide sequences 
corresponding to alr0088, alr7579 and all4779 respectively were 
submitted to GenBank (GenBank Accession Nos. GU225949, 
GU225950, GU225951). 

Overexpression and Purification of His-tagged Proteins 

The plasmids pETalr0088, pETalr7579 and pETall4779 
(Table 2) were transformed into E. coli BL21(pLysS) and 
transformants selected on LBCm 34 Cbioo plates (Table 3). Proteins 
were overexpressed from the respective logarithmic phase cultures 
of E. coli upon addition of 1 mM IPTG for 3 h at 37°C. The 
recombinant His-tagged proteins (Alr0088, Alr7579 and A114779) 
were extracted in lysis buffer (20 mM Tris-HCl, pH 8, 0.5 M 
NaCl, 5 mM imidazole and 0.1% TritonX-100) by sonication and 
purified by Ni-NTA affinity chromatography (Qiagen, Germany) 
using different concentrations of imidazole ranging from 10— 
1000 mM as described earlier. The proteins were eluted 
individually in 1 M imidazole fraction and visualised by 
electrophoretic separation on 14% SDS-polyacrylamide gel 
followed by staining with Coomassie Brilliant Blue (CBB) G-250. 
The proteins were quantified spectrophotometricalfy by measuring 
absorbance at 280 nm using 19480 M" 1 cm" 1 , 20970 M" 1 cm" 1 
and 12950 M 1 cm 1 as the extinction coefficients, estimated 
using Expasy software (web.expa.sj.org/protparam) for Alr0088, 
Alr7579 and A114779 proteins respectively. 



The purified recombinant native proteins were individually 
cross-linked using glutaraldehyde, as described earlier [15], 
followed by precipitation of protein with cold acetone. The pellet 
was air dried, solubilised in IX Laemmli's buffer by heating at 
80°C for 10 min, separated by SDS-PAGE and visualized by 
staining with CBB G-250. 

Molecular mass determination of native purified proteins was 
carried out by gel filtration chromatography using Superdex 
HR200 column. The coumn was equilibrated with Tris-NaCl 
buffer and standard graph obtained using the following proteins: 
Myosin (200 kDa), fi-galactosidase (116 kDa), Phosphorylase-b 
(97.4 kDa), Bovine Serum Albumin (66 kDa), Chicken Albumin 
(45 kDa), Carbonic Anhydrase (29 kDa) and RNase A (13.7 kDa). 
Molecular mass of the three Anabaena proteins was calculated from 
the standard graph on the basis of the elution volume. Presence of 
protein in different eluates/fractions was detected by measuring 
absorbance at 280 nm. 

Electrophoretic Mobility Shift Assay (EMSA) 

A 75-mer oligonucleotide (10 ng) was end-labelled with y- 32 P- 
ATP using Polynucleotide Kinase. The labelled oligo was 
incubated in the presence of specified concentrations of the 
Alr0088, Alr7579 and A114779 proteins in binding buffer [20 mM 
Tris-HCl, pH 8, 1 mM MgCl 2 , 100 mM KC1, 8 mM Dithiothre- 
itol (DTT), 4% sucrose, 80 ug mL" 1 Bovine Serum Albumin 
(BSA)] for 30 min at room temperature and electrophoretically 
separated subsequendy on 6% non-denaturing polyacrylamide gel 
at 150 V for 2 h in IX Tris-borate EDTA (TBE) buffer. Imaging 
of the radioactive gel was carried out using Phosphorimager 
Typhoon Trio Variable mode imager (Wipro-GE-HealthCare, 
USA). 
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Figure 3. Glutarldehyde (Glh)-aided crosslinking of native purified Anabaena SSB-like proteins and their binding to ssDNA. The 

purified native Anabaena proteins (A) Alr0088, (B) Alr7579 and (C) AII4779 were cross-linked with Glh in the presence or absence of M13 ssDNA as 
indicated. The proteins were electrophoretically separated on 12% SDS-polyacrylamide gel followed by staining with Coomassie Brilliant Blue (CBB) G- 
250. The molecular mass of the protein markers used (M1 and M2) are written to the immediate (right/left) of the marker lane. Different molecular 
forms of the native Anabaena proteins are indicated by the arrows. (D) Electrophoretic Mobility Shift Assay (EMSA) of a y- 32 P-ATP labeled 75-mer 
oligonucleotide in the presence of different concentrations of Alr0088, Alr7579 and AII4779 proteins. Following in solution interaction, the assay mix 
was separated by 6% non-denaturing PAGE in 1X TBE and radioactive gel imaged using a phosphorimager. The free ssDNA substrate and the 
different ssDNA-protein complexes formed are indicated. 
doi:1 0.1 371 /journal.pone.0093592.g003 



Fluorescence Measurements 

All three proteins showed maximum excitation at 282 nm. The 
emission maxima were found to be 310, 340 and 335 nm 
respectively for Alr0088, Alr7579 and A114779 proteins in 
20 mM Tris-HCl, pH 8, 1 mM EDTA buffer. The change in 
the intensity of the emitted fluorescence was measured in the 
presence of increasing concentration of ssDNA [poly(dT) or M 1 3 
ssDNA] . The graph of relative fluorescence (%) as a function of 
poly(dT) concentration was used to determine the binding 
constant for the individual proteins to ssDNA as described earlier 
[16]. The binding constants were calculated as the reciprocal of 
the concentration of poly(dT) at which 50% fluorescence 
compared to the initial 100% was detected. The graph depicting 
quenching expressed as the ratio of difference in fluorescence to 
initial fluorescence (AF/Fj) as a function of the ratio of 
concentrations of poly(dT) and protein was used to determine 
the binding modes or occlusion site of the proteins as described 
earlier [17]. It corresponded to the [nt] po i y (dT)/ [Protein] value at 
the point of saturation of quenching of fluorescence. During 
titration, solutions were added from concentrated samples and 
correction for dilution was made as required. All fluorescence 



measurements were performed with Jasco spectrofluorimeter 
FP6500 (Japan) using a quartz cuvette of 1 cm path length at 
room temperature. 

Western Blotting and I mmu no-detection 

The purified A114779 protein was used to raise specific 
polyclonal antibody (anti-A114779 antibody) in rabbit. Proteins 
were extracted from three-day-old wild type and recombinant 
Anabaena cultures in Laemmli's buffer, separated by 14% SDS- 
PAGE followed by electroblotting on to nitrocellulose membrane. 
Immunodetection was carried out using the 1:5000 dilution of 
anti-A114779 antibody, followed by secondary anti-rabbit IgG 
antibody, coupled to alkaline phosphatase and colour development 
using NBT-BCIP. 

Generation of Recombinant Anabaena Strains 

The 0.55 kb Ndel-BamHI fragment from pKTall4779 was ligated 
to pFPN vector (Table 2) [18] at the same restriction sites, 
resulting in the plasmid construct, pF?Nall4779. The 0.81 kb SaK- 
Xmal fragment from p¥?Nall4779 was ligated to pAM1956 vector 
(Table 2) [19] digested with the identical restriction enzymes. The 
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Figure 4. Relative quenching of fluorescence of native purified Anabaena SSB-like proteins and EcoSSB as a function of ssDNA 
concentration. (A-C) Quenching of fluorescence in 20 mM NaCI as a function of poly(dT) concentration of (A) Alr0088, (B) Alr7579 and (C) AII4779 
and purified EcoSSB (commercially available, Sigma) proteins represented as relative fluorescence, considering the observed fluorescence in the 
absence of any poly(dT) as 1 00%. The horizontal line designates the point on the graph wherein relative fluorescence is 50% and the corresponding 
vertical line indicates the concentration of poly(dT) at which it is achieved. Reciprocal of this concentration corresponds to the binding constant of 
the protein for poly(dT). (D) Percent fluorescence quenching of Alr0088, Alr7579 and AII4779 proteins as a function of molar ratio of M13ssDNA and 
protein in the presence of 20 mM or 100 mM NaCI. The fluorescence quenching in the absence of M13ssDNA is considered as 0%. 
doi:10.1371/journal.pone.0093592.g004 



resulting construct was designated as pAMall4779 (Table 2). In 
this construct, the gfpmut2 gene (coding for Green Fluorescent 
Protein, GFP) is co-transcribed with the upstream all4779 gene 
from the 7p sbA1 promoter, but the two transcripts are translated 
independently as described earlier [20]. Recombinant Anabaena 
strain overexpressing AU4779 protein (Anall4779 + ) (Table 3) was 
generated by introducing the plasmid pAMall4779 into Anabaena 
by conjugation as described earlier [20], and repeated selection on 
BG-11 agar plates containing 17 mM NaN0 3 (BG-11, N + ) and 
Ne025, till completely segregated cells, uniformly expressing GFP, 
were obtained. 



Results and Discussion 

Bio-informatic Analysis of Alr0088, Alr7579 and AII4779 
Proteins 

The ah'0088, alr7579 and all4779 genes respectively encode 1 19, 
127 and 182 amino acid long polypeptides (Figure 1A) with an 
estimated molecular mass of 13, 14 and 20 kDa. The prokaryotic 
SSBs are generally about 160-180 amino acids long, having a 
molecular mass of 17-18 kDa, except for SsbB of naturally 
transformable bacteria, which in case of BsSsbB is 113 amino acids 
long [11]. The SsbA protein of B. subtiEs is 172 amino acids long, 
similar in size to EcoSSB [10]. Among the naturally non- 
transformable bacteria, the smallest known bacterial SSB are 
from the thermophilic bacteria, Tliermotoga maritima (TmaSSB) and 
T. neapolitana (TneSSB) consisting of 141 and 142 amino acids 
respectively, having a single OB-fold domain and a C-terminal 
domain with the conserved DEPPF terminal amino acids [21]. 
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Figure 5. Quenching of fluorescence of native purified Anabaena SSB-like proteins compared with that of EcoSSB. The quenching of 
fluorescence of (A) Alr0088, (B) Alr7579, (C) AII4779 and (D) EcoSSB in the presence of 20 mM or 100 mM NaCI was expressed as a ratio of change in 
fluorescence (AF) and initial fluorescence (F,). The (AF/F,) was expressed as a function of ratio of concentrations of poly(dT) and protein. The 
horizontal lines indicate the point of saturation and the vertical lines drawn from the point of saturation indicate the probable length of ssDNA bound 
by one molecular unit of the protein. 
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Among eukaryotes, the /ZmitSSB is 133 amino acids long and does 
not have the region corresponding to l/3 rd of the C-terminal 
region of EcoSSB [22]. Amino acid sequence analysis using 
Conserved Domain Database (CDD) [9] revealed the presence of 
a putative ssDNA-binding OB-fold domain and dimer/tetramer 
interface within N-terminal half in all three proteins similar to that 
in E. coli SSB (Figure 1A). A114779 additionally had a long C- 
terminal region similar to that observed for EcoSSB which 
comprised of a proline-rich region (19 residues) with two glycine 
residues, as compared to EcoSSB which is glycine rich (21 
residues) and has 8 prolines in the corresponding region 
(Figure IB). While multiple glycine residues allow flexibility in 
structure, multiple proline residues provide rigidity and kinks in 
the structure and thus no ordered structure results in gly-rich or 
pro-rich regions [23]. The proline-rich region of AU4779 would 
also separate the positively charged N-terminal and the negatively 



charged C-terminal regions, similar to that in EcoSSB [5] . The N- 
terminal region of A114779 exhibited 26% identical and 48% 
similar amino acid residues and a nearly identical acidic tail 
compared to EcoSSB (Figure IB). In spite of having an N-terminal 
OB-fold, P/G rich region and a C-terminal acidic tail, the low 
homology of A114779 to other known bacterial SSB proteins may 
possibly account for it not being annotated earlier as SSB-like 
protein in the genome database of Anabaena 7120, unlike Alr0088 
and Alr7579 which show a greater homology than A114779 in the 
OB-fold region. 

Biochemical Characterisation of Anabaena Alr0088, 
Alr7579 and AII4779 Proteins 

The Anabaena Alr0088, Alr7579 and A114779 proteins overex- 
pressed in E. coli BL21(pLysS) cells were purified to near 
homogeneity using Ni-NTA affinity chromatography (Figure 2A). 
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Figure 6. Construction of recombinant Anabaena strain overexpressing AII4779 protein. (A) Schematic diagram of the plasmid construct, 
pAMall4779 used for overexpression of AII4779 protein in Anabaena. The different restriction enzymes used for cloning are indicated. (B) Fluorescence 
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strain, Anall4779 + , grown for 3 days in BG-1 1, N~ media. (C) Protein extracts from Anabaena 7120 (lane 1) and Ana//4779 + (lane 2) were separated by 
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reacting AII4779 protein is indicated by an arrow. Equal loading controls are shown below the blot. Other details were as described in legend to 
Figure 2. (D) Growth profile of wild type Anabaena 7120 and recombinant Anabaena strain, Anall4779 + and AnpAM under nitrogen-fixing conditions 
over a period of 7 days. Growth was measured in terms of increase in chlorophyll a content. Recombinant strains were grown in presence of 
neomycin while wild type was grown without antibiotic. 
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Presence of dimer and tetramer interfaces in the amino acid 
sequence (Figure 1A) suggested possibility of formation of multi- 
mers by the protein. Alr0088 and Alr7579 were eluted in two 
distinct fractions and A114779 in a single fraction (Figure 2B) upon 
separation by gel filtration chromatography using Superdex 
HR200. On the basis of elution profile of standard proteins on 
the same matrix, the molecular mass of the different fractions was 
predicted as 14.1 kDa and 25.7 kTJa for Alr0088, 14.5 kDa and 
26.3 kDa for Alr7579 and 20.2 kDa for A114779 (Figure 2B). This 
indicated dimerisation of Alr0088 and Alr7579 proteins as against 
only the monomeric form detected for A114779 protein. Higher 
molecular forms of these proteins were not detected even at higher 
protein concentrations (data not shown). This did not conform to 
the bioinformatic prediction for the three proteins which indicate 



the presence of dimeric and tetrameric interfaces (Figure 1A). 
Further probing of multimeric status was carried out by cross- 
linking the native proteins with glutaraldehyde followed by 
separation by SDS-PAGE. Upon cross-linking, the dimeric forms 
corresponding to 32 kDa for Alr0088 (Figure 3A), 34 kDa for 
Alr7579 (Figure 3B) and 41 kDa for A114779 (Figure 3C) were 
detected, with the levels of the dimeric form being lowest for 
A114779. This could be the reason for the inability to detect a 
higher molecular weight peak during gel-filtration chromatogra- 
phy for A114779 (Figure 2B). In the presence of Ml 3 ssDNA the 
levels of the dimeric 4 1 kDa form as well as a probable tetrameric 
form of ~82 kDa increased (Figure 3G). This suggested that 
A114779 attains the native multimeric conformation preferably in 
the presence of ssDNA. In case of Alr0088 and Alr7579, no effect 
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Figure 7. Effect of AII4779 overexpression on the survival and tolerance of Anabaena to DNA-damage inducing stresses. (A and B) 

Three day-old cultures were concentrated to 10 ug chla density mL _1 and exposed to 6 kGy of 6o Co y-irradiation or to 6 days of desiccation. (A) 
Survival was measured in terms of colony forming units immediately after irradiation (I) or desiccation (D) and compared with the respective 
unirradiated control (CI) or undesiccated control (CD). (B) The stressed and control cultures were washed, inoculated in fresh BG-11, N~, Neo 12 . 5 and 
allowed to recover under normal growth conditions for 7 days. Growth during post-irradiation/desiccation recovery was measured in terms of 
chlorophyll a content and expressed as percent of respective unirradiated/undesiccated controls. (C and D) Three-day-old cultures of recombinant 
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on the levels of the dimeric form or generation of tetrameric form 
was observed even with ssDNA (data not shown). In general, 
bacterial SSB proteins function as tetramers [1], with the 
exception of thermophilic group of organisms (Thermus spp.) 
and the radioresistant microbe Deinococcus radiodurans [24] which 
function as dimer. However, the protomers of SSB of these 
organisms are twice the size of E. colt SSB and contain two OB- 
folds per monomer [24]. 

The DNA binding ability of the SSB-like proteins was assessed 
by Elctrophoretic Mobility Shift Assay (EMSA) and fluorescence 
quenching techniques. Multiple shifts in the mobility of the 75-mer 
ss oligonucleotide was observed in the presence of Alr0088 
(Fig. 3D). Increase in concentration beyond 0.2 |xg Alr0088 did 



not result in any further shifts in mobility (data not shown). 
Alr7579 decreased the mobility of the 75-mer oligo only when 
used at very high concentrations of 1.2—1.4 ug (Fig. 3D). Presence 
of Alr7579 also resulted in detection of multiple bands differing in 
their mobility, but majority of the complex formed, even when low 
concentrations of 0.12 p.g of AU4779 was used, was detected near 
the well (Fig. 3D). Based on this, the binding efficiency for ssDNA 
seems to be maximum for A114779, followed by Alr0088 and the 
least for Alr7579. The binding affinity for each of these proteins 
for ssDNA was calculated by fluorescence quenching technique 
using poly(dT) as the ssDNA substrate. 

The relative fluorescence of native Anabaena proteins Alr0088, 
Alr7579 and A114779, was measured as a function of increasing 
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concentration of poly(dT) at 20 mM NaCl. The relative fluores- 
cence of (i) Alr0088 decreased to a maximum of 40% with 
-450 nM poly(dT) (Figure 4A), (ii) Alr7579 showed less than 20% 
decrease (Figure 4B), and (iii) A114779 up to 40% of the initial 
fluorescence, but at much lower concentrations (~35 nM) of 
poly(dT) (Figure 4C). Based on this, the binding constant, as an 
average of three independent experiments, was calculated as 
2.56±0.4xl0 6 M" 1 for Alr0088, 5.13±0.71 x 10 7 M" 1 for 
A114779 and 6.76xl0 7 M" 1 for EcoSSB (Figure 4C), which was 
comparable to that reported for EcoSSB (5.5±1.5xl0 7 M -1 ) [4]. 
In the absence of C-terminal acidic tail, the binding affinity for 
ssDNA has been shown to increase 10-fold in case of EcoSSB [4], 
as well as for HsmtSSB, which lacks the C-terminal tail, calculated 
as 4x 10 8 M -1 [25]. However, the reverse was found to be true in 
case of Anabaena 7120, with Alr0088 which lacks the acidic tail, 
having 10-fold lower binding affinity than A114779. This could be 
due to the additional absence of the P/ G-rich region as well in 
Alr0088. 

The inability of Alr7579 to bind poly(dT) raised questions on 
whether the OB-fold, responsible for binding ssDNA [1] is active 
in Alr7579. To test this, a larger ssDNA, such as M13 ssDNA was 
used as a substrate. A 60—70% quenching of the fluorescence of 
Alr7579 was observed with the 7 kb M13 ssDNA, the efficiency 
being higher at high NaCl concentration (Figure 4D), which allows 
formation of a more compact structure of ssDNA. The quenching 
of fluorescence of Alr7579 was not observed with thermally 
denatured Ml 3 ssDNA (data not shown). Ml 3 ssDNA is known to 
form secondary structures [26], which are disrupted at higher 
temperature. This suggested that Alr7579 may be recognising 
secondary structures formed with long ssDNA, rather than short 
stretches of linear ssDNA. Both Alr0088 and A114779 also bound 
Ml 3 ssDNA at high salt concentration, but with lower efficiency, 
the quenching of fluorescence being 40% and 18% respectively 
(Figure 4D). While low quenching of fluorescence of Alr0088 by 
Ml 3 ssDNA was observed at low NaCl (Figure 4D), indicating low 
level interactions, no such interaction was observed for A114779 
(Figure 4D). 

In general, SSB proteins interact with ssDNA in multiple 
binding modes, differing in the number of OB-folds which interact 
with the ssDNA. In the (SSB) 35 mode, approximately 35 
nucleotides of ssDNA interact with two subunits of the Ssb 
tetramer, while in (SSB) 65 mode, ~65 nucleotides of ssDNA wrap 
around all four subunits, which is more favoured at higher salt 
concentrations [25] . Based on the quenching of fluorescence (AF/ 
F;) of the three Anabaena proteins with poly(dT) at low (20 mM) 
NaCl and high (100 mM) NaCl concentrations, binding modes or 
occlusion size for each of the protein determined. A single binding 
mode of 54— 55 nucleotides was estimated for Alr0088, which was 
independent of NaCl concentration (Figure 5A). No significant 
quenching of fluorescence of Alr7579 was observed at low or high 
concentrations of NaCl (Figure 5B), while two binding modes 
dependent on NaCl concentration was observed for A114779 
(Figure 5C). The binding size was found to be 35.5 nucleotides at 
20 mM NaCl and 65.9 nucleotides at 100 mM NaCl for A114779 
(Figure 5C), and 32.5 and 70 nucleotides at 20 mM and 100 mM 
NaCl respectively for EcoSSB under identical experimental 
conditions, comparable to the (SSB) 33 and (SSB) 65 modes of 
binding, at low and high salt concentrations respectively, shown 
for EcoSSB [1,27]. Since, (SSB) fi5 mode of binding requires the 
binding of ssDNA to the tetrameric form of SSB [27], and 
molecular form corresponding to a tetramer of A114779 was very 
low, the quenching of fluorescence of A114779 at higher NaCl was 
lower than that at lower NaCl (Figure 5C), as well as that observed 
with EcoSSB (Figure 5D). 



Thus, though all the three proteins i.e. Alr0088, Alr7579 and 
A114779 bind ssDNA, their binding affinity and modes of binding 
are distinct and among these, the binding ability as well as binding 
modes of A114779 was quite similar to other known bacterial SSBs. 
The presence of (P/ G)-rich spacer and a near identical C-terminal 
acidic tail, suggests that in Anabaena 7120, A114779 may also be 
performing in vivo functions similar to those carried out by the 
typical bacterial SSB proteins. Since, overexpression of bacterial 
SSBs are known to influence the repair of stress induced DNA 
damage [28,29], thereby enhancing tolerance to DNA damaging 
stresses, a similar role for A114779 was assessed in Anabaena 7120. 

Physiological Role of AII4779 Protein in Anabaena 7120 

The A114779 protein was overexpressed in trans from the plasmid 
pAMall4779 (Table 2, Figure 6A) in the recombinant Anabaena 
strain, Anall4779 + (Table 3). Due to growth under continuous 
illumination, the expression of the A114779 protein from the light- 
inducible p.sbAl promoter was expected to be constitutive. Co- 
overexpression of the Green Fluorescent Protein (GFP), coded by 
gfpmut2 in the pAMall4779 plasmid, provided a handy tool to 
distinguish the fully segregated recombinant Anall4779 + strain 
exhibiting green fluorescence, from the wild type Anabaena 7120 
which exhibited red fluorescence upon excitation with X i70 light 
(Figure 6B). It also ensured expression of the upstream gene. The 
overexpression of A114779 in Anall4779 + cells was indeed 
confirmed by immunodetection with anti-A114779 antibody 
(Figure 6C). Under normal growth conditions, the nitrogen-fixing 
cultures of Aaall4779 + grew marginally slower than the wild type 
Anabaena 7120 cultures, and at rates comparable to the recombi- 
nant Anabaena strain harbouring pAM1956 vector, AnpAM 
(Table 3) [30] (Figure 5D). This is possibly due to the presence 
of neomycin in the growth medium used for recombinant strain. 

The effect of overexpression of A114779 on the ability of 
Anabaena 7120 to tolerate DNA damage inducing stresses was 
analysed in response to two distinct types of DNA damages i.e. (i) 
y-irradiation and desiccation which cause single strand and double 
strand breaks, and (ii) UV-B and mitomycinC which cause 
formation of pyrimidine dimers and DNA adducts respectively. 
The empty vector control recombinant strain, AnpAM exhibited 
about 55% and 44% survival upon exposure to 6 kGy of 6o Co y- 
rays or 6 days of desiccation respectively (Figure 7 A). Upon 
constitutive overexpression of A114779 in Anall4779 + cells, the 
survival increased to about 60% after exposure to 6 kGy of y-rays 
and 70% after 6 days of desiccation (Figure 7A). The recovery of 
irradiated cultures of Anabaena, measured in terms of chlorophyll a 
content increased from about 50% to over 100% in cells 
overexpressing A114779 (Figure 7B) suggesting better tolerance to 
radiation. Such correlation was however, not found in post 
desiccation recovery, (Figure 7B), possibly due to additional 
stresses, such as osmotic stress experienced during desiccation 
followed by rehydration of these cells. Of the other two SSB-like 
proteins of Anabaena, overexpression of Alr0088 decreased the 
radiation tolerance of Anabaena, while that of Alr7579 had no effect 
[14]. This suggested that A114779 is the typical bacterial SSB of 
Anabaena, involved in the repair of single and double strand breaks 
in DNA, possibly as part of a larger DNA repair complex, which is 
yet to be identified. 

Overexpression of A114779 was also beneficial in protection 
against stresses which caused formation of DNA adducts. The 
survival of AnpAM was about 75% and 11.8% respectively upon 
exposure to 0.75 and 1.5 kj m 2 of UV-B irradiation, which 
increased to 77% and 50% respectively in Anall4779 + cells, 
overexpressing A114779 protein (Figure 7C). The beneficial effect 
of the constitutive overexpression of A114779 was more pro- 
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nounced when exposed to higher doses (1.5 kj m 2 ) of UV-B 
(Figure 7C), while at lower dose of (0.75 kj m" 2 ), that of Alr0088 
was more benficial [14]. AnpAM cells exhibited 50% survival 
upon exposure to 4 ug mitomycinC mL 1 for 30 min, which 
increased to 85% upon overexpression of A114779 in Anall4779 + 
cells (Figure 7D), comparable to that observed upon overexpres- 
sion of Alr7579, but lower than that with Alr0088 [14]. Thus, the 
presence of high levels of A114779 in Anabaena possibly decreased 
the net damage to DNA, both in terms of single and double 
stranded breaks as well as formation of DNA adducts, possibly by 
efficient repair of the damaged DNA. Overexpression of SSB has 
been shown to be beneficial by aiding DNA repair in E. coli cells 
[28]. 

Thus, A114779 is the major typical bacterial SSB of Anabaena 
7120 in terms of structural domains, binding to ssDNA and 
physiological role in DNA repair. The genes coding for the two 
atypical truncated annotated SSB proteins, Alr0088 and Alr7579 
may have arisen due to gene duplication as suggested for PriB, a 
dimeric protein with only OB-fold and capable of binding ssDNA 
[31] and may be involved in other functions such as replication 
and recombination. The unicellular cyanobacterium, Synechocystis 
PCC6803 has been shown to be naturally transformable with 
possible involvement of competence proteins, ComA (Slr0197) 
[32] and ComF (Slr0388) [33]. The orthologs of these genes are 
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also found in Anabaena 7120, annotated as all3087 and alr2926 
respectively (http:// genome.microbedb.jp/cyanobase/ Anabaena), 
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naturally G-terminal truncated BsSsbB, is involved in competence 
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the site of interaction with DNA repair proteins for E. coli SSB [3]. 
Owing to the presence of an acidic tail, A114779 upon overex- 
pression offers better protection from DNA-damage when 
subjected to different DNA-damage-inducing stresses. Based on 
data presented, we propose that A114779 be re-annotated as the 
gene coding for typical single stranded DNA binding protein (SSB) 
and the corresponding ORF be annotated as the ssb gene of 
Anabaena 7120. 
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